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INTRODUCTION 


Consider  a  production  function  =  g(x^,3)  + 
i=l,2,...,N,  where  y^  =  output  for  observation  i, 
xi  =  vector  of  inputs  for  observation  i,  3  =  vector 
of  parameters,  error  term  for  observation  i.  The 
"stochastic  frontier"  (also  called  "composed  error") 
model,  introduced  by  Aigner,  Lovell,  and  Schmidt  (1977) 
and  Meeusen  and  van  den  Broeck  (1977),  postulates  that 
the  error  term  is  made  up  of  two  independent 
components: 


2 

where  v^  ~N(0,<rv)  is  a  two-sided  error  term 
representing  the  usual  statistical  noise  found  in  any 
relationship,  and  u(  >  0  is  a  one-sided  error  term 
representing  technical  inefficiency.  Note  tht  u^ 
measures  technical  inefficiency  in  the  sense  that  it 
measures  the  shortfall  of  output  (yi)  from  its 
maximal  possible  value  (g(x  j, ,  3  )+Vi  ) . 


When  a  model  of  this  form  is  estimated,  one  readily 
obtains  residuals  =  y^  -  gfx^/f^),  which  can  be  f /)i? 

regarded  as  estimates  of  error  terms  However, 


the  problem  of  decomposing  such  an  estimate  into 

separate  estimates  of  the  components  v^  and  ui  has//.,, 
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remained  unsolved  for  some  time.  Of  course,  the 
average  technical  inefficiency — the  mean  of  the 
distribution  of  the  u^ — is  easily  calculated.  For 
example,  in  the  half-normal  case  (u^  distributed  as 

2 

the  absolute  value  of  a  N(0,ffu)  variable).  The  mean 
technical  inefficiency  is  and  this  can  be 

evaluated  given  one's  estimate  of  <ru,  as  in  Aigner, 
Lovell,  and  Schmidt  (1977)  or  Schmidt  and  Lovell 
(1979).  Or  average  technical  inefficiency  can  be 
estimated  by  the  average  of  the  e^.  But  it  is  also 
clearly  desirable  to  be  able  to  estimate  the  technical 
inefficiency  u^  for  each  observation. 

Intuitively,  this  should  be  possible  because 
=  vi  -  ui  can  be  estimated  and  obviously  contains 
information  on  u^.  In  this  paper,  we  proceed  by 
considering  the  conditional  distribution  of  u^  given 
«i.  This  distribution  contains  whatever  information 
yields  about  ui>  Our  estimate  of  ui  is  simply 
the  mean  of  this  conditional  distribution,  evaluated  at 
«i  =  '«i.  The  variance  of  the  conditional  distribu¬ 
tion  is  also  calculated,  since  it  indicates  how  close 
realizations  of  the  conditional  distribution  are  likely 
to  be  to  the  conditional  mean.  The  conditional  means 
and  variances  are  calculated  for  the  commonly  assumed 
cases  of  half-normal  and  exponential  u^,  resulting  in 
easily  evaluated  expressions. 
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Replacing  <  by  ^ gives  the  desired  estimate  of  u. 
Incidentally,  it  is  easily  verified  that  the  expres¬ 
sion  in  (4)  is  non-negative,  and  monotonic  in  «. 


The  estimate  of  u  obtained  by  evaluating  (4)  at  « 

contains  two  types  of  variance.  The  first  is 

ordinary  sampling  error,  due  to  the  variability  of 

as  an  estimate  of  «  (which  in  turn  is  due  to  the 
/\ 

variability  of  3  as  an  estimate  of  3).  This 
variability  disappears  asymptotically  and  can, 
therefore,  be  ignored  for  large  enough  sample  sizes. 
However,  the  estimate  in  (4)  still  contains  the 
variance  of  the  conditional  distribution,  which  is 
independent  of  sample  size  and  should  not  be  ignored; 
it  is  a  reflection  of  the  obvious  fact  that  e  con¬ 
tains  only  imperfect  information  about  u.  The 
conditional  variance  is  simply  the  variance  of  a 


variable  truncated  at  zero,  which  is 


Var(u  «  )  = 


i  +  *L  > 

*  l-/f(«A/<r  ) 


r  f(tAA )  i 

[1-F(«XA)J 


(5) 


Thus,  (5)  gives  a  measure  of  the  variability  of  the 
estimate  (4),  for  sample  sizes  large  enough  to  ignore 
the  sampling  error.  Of  course,  since  we  are  dealing 
with  a  truncated  normal  distribution,  we  should  be 
careful  in  our  interpretation  of  standard  errors; 
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confidence  intervals  for  u  might  better  come  from  the 
conditional  distribution  (Theorem  1)  itself  than  from 
consideration  of  u  relative  to  its  standard  error. 

THE  EXPONENTIAL  CASE 

This  case  is  identical  to  the  half-normal  case,  except 
that  now  the  technical  inefficiency  error  term  u  is 
assumed  to  follow  the  one-parameter  exponential 
distribution  with  density 

f  ( u )  =  exp  (“u/au)/cru  (6) 

Our  results  are  similar  to  those  for  the  half-normal 
case.  Define  A  =  «/ov  +crv/au*  Then  we  have  the 
following  result,  proved  in  the  appendix: 

THEOREM  2:  The  conditional  distribution  of  u 

2 

given  e  is  that  of  a  N(-c?vA,  c*v)  variable, 
truncated  at  zero. 

The  mean  and  variance  of  this  distributon  are  therefore 
exactly  as  given  in  (4)  and  (5)  if  we  just  replace  o* 
by  ov  and  replace  «X/a  by  A.  As  in  the  half-normal 
case,  these  would  be  evaluated  at  «  = 


AN  EXAMPLE 

Schmidt  and  Lovell  (1980)  estimated  a  system  consisting 
of  a  stochastic  frontier  production  function  and 


first-order  conditions  for  cost  minimization,  based  on 
a  sample  of  111  steam-electric  generating  plants.  The 
estimates  on  which  our  calculations  are  based  are  those 
reported  in  the  first  column  of  table  1  of  Schmidt  and 

Lovell.  In  particular,  note  that  ^ ^  =  .014452, 

=  .003261,  and  that  the  estimated  average  technical 
inefficiency  (mean  of  u)  is  .09592,  indicating  about 
9.6  percent  technical  inefficiency. 

We  have  calculated  'u  =  E(uJ't')  for  each  observation, 
based  on  (4)  since  estimation  assumed  half-normal  u,  as 
well  as  the  variance  of  this  estimate  based  on  (5).  We 
will  not  present  results  for  all  111  observations,  but 
rather,  point  out  a  few  things  of  interest.  The  mean 
of  the  u’s  is  .08387,  which  is  in  the  same  ballpark  as 
the  .09592  reported  above  and  as  the  mean  of  -.07433 
for  the  ^'s.  The  smallest  ti'  (most  technically  effi¬ 
cient  plant)  was  .01658,  with  a  standard  error  of 
.01534,  based  on  't'  =  .15891.  This  is  a  modest  outlier, 
with  /t'  about  2.75  standard  deviations  from  the  mean 
(The  other  observations  had  u'  under  .02  also.)  The 
most  technically  efficient  observations  can  be 
characterized  as  having  high  outputs  (the  two  most 
efficient  plants  have  the  second  and  third  largest 
outputs  in  the  sample),  low  capital  stocks  and  high 
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levels  of  fuel  consumption  and  labor  usage.  Their 
level  of  allocative  inefficiency  (see  Schmidt  and 


Lovell  (1979))  is  below  average,  though  not  strongly 

/N 

so.  On  the  other  hand,  the  largest  u  (most  technically 
inefficient  plant)  was  .37156,  with  a  standard  error  of 
.05105,  based  on  =  -.45540.  This  is  a  fairly  large 
outlier,  in  the  sense  that  ^ is  almost  4  standard 
deviations  from  the  mean  ^  j  but  one  other  observation 
had  an  almost  identical  value  of  'u'  and'*'.  The  most 

i 

technically  inefficient  observations  have  rather 
average  outputs  but  higher  than  average  uses  of 
capital,  fuel,  and  labor.  They  also  had  slightly  above 
average  levels  of  allocative  inefficiency. 


CONCLUSIONS 


'^In  this  paper,  we-  hev-e  -proposed  a  method  separating 

A 

the  error  term  of  the  stochastic  frontier  model  into 
its  two  components  for  each  observation.  This^enables 
one  to  estimate  the  level  of  technical  inefficiency  for 
each  observation  in  the  sample,  and  largely  removes 
what  had  been  viewed  as  a  considerable  disadvantage  of 
the  stochastic  frontier  model  relative  to  other  models 
(so-called  deterministic  frontiers)  for  which  technical 
inefficiency  was  readily  measured  for  each  observation. 


APPENDIX 


THE  HALF-NORMAL  CASE 

o 

In  the  half-normal  case,  v  ~  N(0,ffv),  u  is  distributed 

9 

as  the  absolute  value  of  N(0,<xu),  v  and  u  are 
independent,  and  «  =  v-u.  We  wish  to  find  the 
distribution  of  u  conditional  on  e  . 


The  joint  density  of  u  and  v  is  the  product  of  their 
individual  densities,  since  they  are  independent: 


f  (u,v) 


1 


Tffuffv 


exp 


2 


u 


u>0  . 


(Al) 


Making  the  transformation  *  =  v-u,  the  joint  density  of 
u  and  e  is 


f  (u,  < ) 


,r<ruffv 


exp 


(lT+€*  +  2u€  ) 

2'v 


(A2) 


The  density  of  «  is  given  by  equation  (8)  of  Aigner, 
Lovell,  and  Schmidt  (1977): 


f<«)  = 


( 1-F )  exp 


_L_  *2 

2<j2 


(A3) 
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2  2  2 

where  *  "<ru+<7v'  °t/°v'  and  F  *s  t^ie  standard 

normal  cdf,  evaluated  at  «X/c.  Therefore,  the 
conditional  density  of  u  given  «  is  the  ratio  of  (A2) 
to  (A3),  which  we  can  write  as 


f  (u  |«) 


(A4) 


2  2  2  2 
where  oj  =  o*a*/o 


With  a  little  algebra. 


£(u  l‘>  "  exp 


-4  (u+a2«/a2)2 
2  a% 


this  simplifies  to 
,  u>0.  (A5) 


Except  for  the  term  involving  1-F,  this  looks  like  the 

2  2 

density  of  with  m*  =  -oui/a*.  Finally,  note 

that  F  is  evaluated  at  «X /o  =  and  thus  (1-F)  is  just 

2 

the  probability  that  a  N(m*,o*)  variable  be  positive. 

2 

Thus,  ( A5)  is  indeed  the  density  of  a  N(m*,o*)  variable 
truncated  at  zero. 


The  Exponential  Case 

2 

In  the  exponential  case,  v  —  N(0,au)  while  u  is  independent 
of  v  and  has  the  one-parameter  exponential  density 

f(u)  “  ^  exP  •  u^-°  (A6) 
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with  «=  v-u,  the  joint  density  of  u  and*  ,  becomes 


f(u,«)  = 


exp 


1  2  /  <  .  l\  1 

2o2  U  \o2  ou)  U<  '  2cj2  ‘ 

V  \  V  /  V 


From  Aigner,  Lovell,  and  Schmidt  (1977,  p.  29),  the  density 
of  t  is 


1-F 

f  (  «  )  =  -  exp 


°u  2a' 


( A8 ) 


where  F  is  again  the  standard  normal  cdf,  but  evaluated  at 


A  =  «  /a  +  a  /  a 
v  v'  u 


(A9) 


Taking  the  ratio  of  (A7)  to  (A8)  and  simplifying, 

f(u|«  )  =  — r  ^2^-  exp  “  ^  +  0yA  ^  |  ,  u>0.  (A10) 

.  .  .  2 
But  this  is  just  the  density  of  a  N(-avA,  a )  variable, 

truncated  at  zero. 


( A7 ) 
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